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0S ANMETHRACT 


The purpose of this thesis is to determine the power re- 
lationship, through computer simulation, between the paranet ric 
ANOV and non-paranectric Wilson tests under controlled condi- 
tions of crror non-normality. 


Data is simulated using the 12 cell factorial ANOV modeci 
with three levels of factor A, four levels of factor B, and 
six observations per cell. Interaction is characterized such 
that its effect is proportional to the effect of factor A with 
the constant of proportionality related to fuctor B. Non- 
normality of the error term is characterized in three distri- 
bution types: skewed, leptokurtic (pe sked), and platykurtic 
(flat). Four degrecs of the three err r distribution types 
are utilized, cach related to the Pearson camily of frequency 
curves. 


Three thousand-seven htndred sets of data are gencrated 
for cach degree of crror iype. Power is then estinated di- 
rectly for both the ANOV F tests and Wilson Chi-square tests 
for main effects and interaction. Couparison is then made 
between corresponding tests showing the effect of error non- 
normality on the power of each. 
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ABSTRACT 


| The purpose of this thesis is to determine the power 
relationship, through computer simulation, between the 
parametric ANOV and non-parametric Wilson tests under con- 
trolled cencitions of error non-normality. 

Data is simulated using the 12 cell factorial ANOV 
model with three levels of factor A, four levels of factor 
B, and six observations per cell. Interaction is charac- 
terized such that its effect is proportional to the effect 
of factor A with the constant of proportionality related to 
factor B. Non-normality of the error term is characterized 
in three distribution types: skewed, leptokurtic (neaked), 
and platykurtic {flat}. four degrees of the three e:sus dis- 
tribution types are utilized, each related to the Pearson 
family of frequency curves. 

. Three thousand-seven hundred sets of data are generated 
for each degree of error type. Power is then estimated di- 
rectly for both the ANOV F tests and Wilson Chi-square tests 
for wain effects and interaction. Comparison is then made 


between corresponding tests showing the effect of error 


“non-normality on the power of each. 
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I. INTRODUCTION 


A. PEARSON'S STUDY OF NON-NORMAL VARIATION 
The normality of error assumption is a well known re- 
quirement when using Analysis of Variance (ANOV) techniques. 
It has been shown, however, that this requirement is not 
stringent when only type I error is the point of concern. 
The iieevatuie does Suggest that error normality is a re- 
quirement relative to type II error and correct determination 
of power of the test. | 
Pexrson [1] studied error non-normality for a case in- 
volving the one way ANOV classification. His study was 
based on the distribution of the correlation ratio n? which, 
as used, wes cquivaient io ihe F stulistic; nt ve disse 
tributions were chcsen with non-normality of error charac- 
terized by Pearson coefficients B, and B,? and by Pearson 
curve type*. Those chosen were: . | | 
RB, = 0.0 B, = 2.50 (Type II, symmetrical ceo 
B, = 0.0: 8B, = 4.10 (Type VI, symnetrical icptokurtic), 
B, = 0.0 6, = 7.05 (Type VII, symmetrical leptokurtic), 
B, = 0.2 6, * 3.3 (Type II1, skew), 
B, = 0.49 B, = 3.72 {Type ill, skew), 
8, . 0.99 B, = 3.83 (Type Cees skew with abrupt Start). 


Pearson concluded that the distribution of n?, and therefore 


1 Reference 2- defines ‘these coefficients - as 8, * viva, 
B * w*/us where j Teprescnts the ith central moment. 


. 2 Reference 3 transforms the 13 curve types of Reference 
2 into pro>ability density Functions. 


7% 


of F, within the range of the above six distributions ade- 


quately met the "normality of error" requirement for Analysis 


of Variance. He further concluded that within this range 

| there would be little chance of rejecting a true null hypoth- 

esis because of non-nornality, but that in the extreme cases 
i of non-normal variation there would always be a danger of 


accepting a false null hypothesis. Thus, Pearson suggested 
thet extreme cases of error non-normality may result in a 


reduction of power in the F test. He thereby raised the ques - 


<i ts ebt—s i: once taal aa Tiedt SEL tea ARIE BSAA Zang! heme 


tion as to how error non-normality would affect the F test 
power for a particular ANOV design. 

Kirk [4], citing the above study by Pearson (1931) and > 
a study by Norton as reported by Lindquist [5], extended — 
Pearson's conclusion relating to type I errov to ail fixed- 
effects ANOV models utilizing the F distribution. Kirk held 
that, in general, unless the departure from normality is so 


extreme that it can be readily detected by visual inspection 


EIEN A CERES ECO NE STM ISS ETT rp 
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of the data, the departure will have little effect on the 
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probability associated with the test of significance (type 
I error) However, Kirk made no reference to the effect of 


non-normality on the power of the test. 


B. A NON- ~ PARAMETRIC ANOV TEST 


chica Ae aici ORE Oe 2 ei Ninepacen ge,  k 


The problem of. meeting the error normality assumption. 
; for using the parametric ANOV P tests is avoided by choosing 


a non-parametric test. A search of the literature was made 


to find such a test which included a non-parametric method 


for testing interactions in the ANCV model. Wilson [6] 


PRS derinneNs © prota siptente aR IonemiRMnRE, Eesetitar setIOE eli nt Nia inantanes cite QO OME EMRE a 
, . 


developed such a test, based on the Chi-square distribution, 
for testing hypothesis in two-way, three-way, up to n-way 
ANOV designs. The test procedure, applicable only to fixed 
effects models, involves classifying the scores in each 
cell as above or below the overall median and using the fact 
that a total Chi-square, like assim of squares, can be de- 
composed into additive parts. A new set of formulas are in- | 
troduced for working with factorial designs in. Chi-square 
terms. A description of the test including Pie Chisauaxe 
formulas is contained in Appendix A. | 
Sheffield [7].showed how the Wilson test could be con- 


verted to a conventional ANOV procedure, whereby the Wilson 


‘Chi-square formulation could be replaced with "non-parametric 


FY tests. 

McNemar [8] contrasted ‘the outcones. of the Wilson test 
and the parametric F test on seven batches of data, each in- 
volving two-way classification. Based on 21 comparisons (A, 
B, and AxB effects) he suggested that the power of the Wilson 
tests was lower than what could be reasonably expected. How- 


ever, McNemar failed to determine what constituted "yeason- 


- able" power for the Wilson test. In addition, he based his 


conclusions on a small sample which he admittedly assuned 


met the: parametric ANOV requirements of normality. .and hono- 


scadacity. To the author's knowledge, the literature does 


not report any other attempt to ovtain an indication of power 


_ for the Wilson Chi-square tests. 


a 


C. YPE OF INTERACTION 

Williams [9] studied the problem of interpreting the 
Bt effects of different factors when those effects are not ad- 
ditive. He pointed out that attention must be paid to the 
way in which the factors interact. He suggested that a rea- 
sonable assumption, in the two factor case, was to consider 
that the interaction effect was proportional to the effect 
of one factor with the constant of proportionality related 
to the second factor (i.e., (0B) 55 = a5C5). Williams 
raised the question as to the effect of the type of inter- 
action on power of the tests for hypotheses ccncerning main 


effects as well as interaction in the factorial ANOV design. 


~D. . POWER OF THE TEST 
Power of the test is defined as the probability of re- 
jecting a false nuli hypothesis. Power functions have been 
"developed for parametric tests, since these tests are based 
on assuned known distributions. Power curves for parametric 
ANOV tests are contained in the Appendix to Reference 10. 

One of the arguments for dntering these curves is $, a func- 
tion of the factor non-centrality parameter and error vari- 
ance. Formulas for obtaining the argument 4 for ANOV designs 
involving interaction are contained in Re ference ll. | 


Sicgel [12] proposed that the power of non-parametric 


tests car be expressed by comparison with the most powerful 
a _ @Xisting parametric test that is used for the same purpose 
b. since no power functions exist for the "distribution-free" 


non-parametric tests. Sicgel pointed out that the more 


Pe 8 
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general the non-parametric test (the fewer the assumptions) 
the less powerful the test will be in comparison with a 
parametric test involving the same sample size. He further 
stated that the F test, because of its strong assumptions, 
is the most powerful test of its type. Siegel called his 
power comparison concept "power efficiency." It is a func- 
tion of the increase in sample size of the non-parametric 
test over that of the parametric test which is necessary 

to make the two tests equally powerful. 

Theoretically, the power of a statistical test, para- 
metric or non-parametric, can also be estimated empirically 
through computer simulation. <A simulation model for a par- 
ticular ANOV design can be constructed and data generated 
so that the desired factor und interaction eftects ar 
ent in the data. The null hypothesis, that the "built-in" 
effect is not present, is then tested at a desired leve] of 
significance. Power at the desired level of significance 
is measured by taking the ratio of the number of times the 
false null hypothesis is rejected over the number of times 


the test is conducted. . 


E. PURPOSE OF THESIS 

The purpose of this thesis is to develop a computer 
simul ation model for obtaining a power comparison between 
the parametric ANOV F tests and the non-paranctric Wilson 
Chi-square tests under varying conditions of unimodal crror 
non-normality. It is envisaged that che general method will 


be applicable in determining the power of other non-paramet- 


ric tests. 


CNP Ee 


li, METHOD 


A. ANOV DESIGN AND GENERAL PLAN 

A 3x4 ANOV design with six replications per cell was 
chosen because of its general nature and because it was de- 
sired to compare interaction detection capabilities of the 
Wilson and F tests, as well as that of main effects. The 
method developed is presented in five parts: characterizing i 
error, characterizing interaction, the data simulation model, : 
the computer program, and determination of test replications 


for desired confidence. 


b. CHARACTERIZING ERROR 
Only unimodal error was considered. Error was charac- 

terized as skewed, leptokurtic, and platykurtic. Four degrees 
of each type were eoisidered’ from violent to almost normal. id 
The degenerate case for each error distribution type was the | | 
normal error distribution and this was considered as a fifth 
degree. Rach degree was identified by Pearson coefficients 
B, and 8,. For continuity with Pearson’s work, some degrees 
were chosen identical to the error distributions studied in 
Re ference 1. All error distributions were selected with a 


mean of 0 and a variance of 4. 


C. CHARACTERIZING INTERACTION 


Interaction was characterized as follows: 


(a8) 5 = a ee jell : (1) 


10. 


Sere npereg kth deamon anwar nom tere MEAN ners ne atthe; cosine 


the effect of factor A, a., and increasing with each level 


where c.'s were constants, evenly spaced, increasing and 


J 
te. = 0, (Note that the "evenly spaced" and "increasing" 


jj 
features are not required.) This characterization resulted 


in the interaction effect, (a8) 55, being proportional to 


1 


of factor B. 
| 


D. THE DATA SIMULATION MODEL | | 


1. The Mathematical Model 


ae 


ijk 7 Ht 4g + By + COB)S5 + 255y (2) 


i= 1,°°,3; j = 1,°°,4; k = 1,°°,6. 


2,5, Tepresents the error which under the ANOV assumption is ; 
distributed N(0,02). It was the distribution of this error 
term that was changed from normality, but with a constant 
variance of o* = 4, in order to determine the effect of non- 
normality on the Wilson and F tests. rhe development of 
equation (2) into the twelve cell models shown below is con- 
tained in Appendix B. 

- Model for cell one: 


York =U * 9% + By + COB)Q) * 279% k=i,**,6. (3) 


Model for cell two: 


Yu2k ite oo lade 5 Re! (4B) 45 * 2)0x k=1,°*,6. (4) 


_. Model for cel) three: 


Y 3k "uta, + Bs ¢ (oB)y3 * 25, k=#1,°°,6. (5) 


Model for 


cell four: 


Yuqk 7 tO > By - By - By - (a8)a, - (08),, 


~ (OB) 15 * Zh 4x 


Model for 


Your 7 H+ a, 


Model for 
ake > ee 
Model for 
gig WOES 
Model for 


Youk = HT 


cell five: 


+ By + (oB)oy + Zax 


cell six: 


* By + (08)o2 * 22% 


cell seven: 


* Bs + (08)25 * Zo5y 


cell eight: 


oe eek oe 


- (0B)o5 + Zo4y 


Model for 


York 7 U7 Oy 


* 231k 
Model for 
Yuan ™ Ho Oy 


* 232k 


Model for cell eleven: 


Ys3q 7 ¥ > oy 


* 233% 


cell nine: 


k=1,°°,6, 


k=1,°°,6. 


k=1,°°,6. 


k=1,°°,6. 


33 = (eB) 54 ia L48 Joo 


k=1,° ® »6. 


a + By a (0B) 4, a (0B) 24 


cell ten: 


k=1,°*,6. 


a2. * By - (aB)i2 > (a8) 22 


k=) ,°*°,6. 


"@) + Bs - (08))5 - (08)25 


12 


k=1,°°,6. 


(6) 


(7) 


(8) 


(9) 


(10) 


(12) 


(12) 


(13) 


ree seecerwnetaents saute piacentatiaiLerionn. “teed 
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Model for ceil twelve: 
Ygaq = Uv Gy 7 Og - By - By - Bs * (OB)y) + (08)q, 
* (08),5 + (B)91 + (a8) 22 + (aB)25 + 2344 
ke1,°*,6. (14) 


Procedures for generating error variates 244k and 
for determining the parameter values Oty» Oo s°**, (0B) 52 are 


given below. It may be noted that several of the parameters 


cf equation (2) have dropped out of the twelve cell models 
as explained in Appendix B. 


2. Generation of Error, 254k 


oe bid 


a. Normal Error Distribution i 
Normal error varvjates, with waan of 0 and vari: 
ance of 4, were generated by using the central limit anproach 
contained in Reference 13. <A normal variate z was made by 


applying the following simulation formula: 


, Xk 
ice <i _k 
aes ea i: | * uy (15) 
. j=] ; 
where: 
iS? desired mean. of the normal error veriate (here=0), 


06°" desired standard deviation of the normal error 
variate (here=2), 
yr = a Uniform (0,1) random number, and 
k « number of random variates desired for approximating 


a normal variate by the central limit approach. A value of 


BEM 


k#12 was chosen for convenience which reduced (15) to 
12 | 
220, ). x, - 6.0). - (16) 
i=l 
b. Skewed Error Distribution 
The mcst violent degree of skewed error was an 
exponential distribution and was generated using the expo- 
nential generator shown in Reference 13. Advantage was taken 
of the fact that 


1 \ 
r= exp [- —2z], 
u 
z 


and 

ze U log r, (17) 
where z in this case is an exponential variate with desired 
mean of yw, and desired variance of oF = ue = 4, The variate 
z was then transformed by the below formula to obtain a mean 


of Q. 


The other three degrees of skewed error were 
generated with the gamma (erlang) generator of Reference 13. 


The simulation formula was developed from the probability 


density function (pdf) 


k z (k-1) ez 


£2) * “en 


where a > 0, k is a positive integer, and z is a non-negative 


(18). 


erlang variate. Here 


14 


and 


ke#=—, 


Erlang variates were made by taking the sum of k exponential 


variates, 


k 
»-i) 
BO log Ts, 


i=] 
which is equivalent to 
k 
ze - “ log TTr, (19) 
i=1 
Since M, had to be greater than 0 in the simulation equation | 
(19), the below transformation was used to transform the 
Vaviste 2c that -the:-resul cane -errer ‘distribution fad 2 moan ! 
of 0. 
ee Pe | (20) 


The three combinations of a and k chosen to generate the 


three levels of erlang variates with mean 0 and variance 4 


+ 


were, 

| a k 
- 2.0 16 
7 1.0 4 
og 
; v2 /2 ra 


c. Leptokurtic Error Distribution 
The four degrees of leptokurtic error were gens. 
erated by sampling from an empirical cumulative distribution 


function (cdf). The below pdf cquation for a type VII 


1S 


2 ' hid te a alata 7 - 3 
le ne te ee ee 
7 . . 
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Pearson curve (leptokurtic) was obtained from Reference 


3. 


1 x? -m ; 
= + mee 
> ane 3 (21) 
where 
5B, - § 
m= — aig 
B. yee 
2, 2u2B2 ; 
a‘ = B73 and yy = desired variance = 4, 
y,.* NN. fa) and N = number of desired 
alt I (m-%s) variates in the 
distribution 
and 
x = abscissa index value for the associated 


ordinate density value, y. 

One hundred index values from -12.C to + 12.0, six standard 
deviations, were used in making an empirical pdf by computer 
Simulation. A routine was incorporated into the program that 
calculated the mean, variance, unbiased estimates 8 and Bos 
and frequencies at the 100 index points (i.e., histogram 
frequencies). This was done to insure that the error gener- 


ated was accurate. The pdf was converted to a cdf by succes- 


sively summing the pdf values along the same horizontal axis 


(-12.0 to +#12.0). A Uniform (0,1) random number was then 
generated and used to sample from the enpirical cdf. A 


7 binary search technique was incorporated in the computer pro- 


gram at this “sampling" stage to reduce computer time. The . 


error variate was determined by interpolating between the. 


two index points on the horizontal axis which bracketed the 


16 
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probability value from the cdf that corresponded to the sam- 
pling random number. 
d. Platykurtic Error Distribution 
The four degrees of platykurtic error were gen-. 
erated by a similar empirica] cdf sampling method. The pdf 


equation determined from Reference 3 for a type II curve was: 


; ova 
Y"* Yo : ar (22) 
where 
5B, - 9 
mh = - 
. 2(3-B5) 
2u, B 
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A similar routine for calculating mean, variance, By» Bos 
and histogram frequencies was also included in the computer 
program for platykurtic error (and also for skewed and nor- 
mal error) to insure thut the results were as desired. Se- 
lected computcr programs are attached following the appendices! 
to this thesis. Smooth histogram curves for the degrees of 
each error type are shown in Appendix C. 
3. Model Parameter Values 
The Wilsor test was the first claimed distribution 


free method for testing interactions in the ANOV model. The re- 


‘fore, determining the power of such an interaction test was 
a primary consideration, It was desired to chose parameters 


for the 3x{i model (equations (3) through (14)) which would 
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guarantee that the power of the parametric F test for in- 
teraction would be in a-sensitive range on the ANOV power 
curve. After studying tne appropriate power curve, shown 
in Appendix D, a desired interaction test over of .67 was 
chosen. This gave a $ (curve argument) value of 1.3. Work- 
ing backwards from this, subject to the conditions given in 


Appendix B and the chosen interaction characterizaticn, the 


below parameter values were determined for equations (3) 
through (14). See Appendix D for development and computa- 


tions. 


Bo 
u 


100.00 .. 
a, = -1.03 
a = 0.28 


: z By = -1.03 
B, = -1.03 
B, = 1.03 


(oB),, = 1.4832 
(a8). = 6.4944 
(08),5 * 0.4944 
(a8), =. -0.4032 
(0B). = -0.1344 

(aB),3 = 0.1344, 


EB. THE COMPUTER PROGRAM 


The gencration of error variates was incorporated into 


4 


is ied” i gah cision ocote etter ithaendinn areata are, 


_ the main computer program. A different "package" was written 


for cach degree of each error distribution type. Six different 
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replication of error variates were then applied in each of 
the 12 cell models to produce 72 observations for testing 
by the parametric ANOV F tests and the Wilson Chi-square 
tests. 


Separate subroutines were written for the parametric 


ANOV and Wilson tests. F and Chi-square statistic values 
were computed for main and interaction effects. 

In the main program these computed values were compared 
to the threshold values from the F and Chi-square tables at 
the 5 per cent level of significance: If the calculated F 
or Chi-square value was greater than or equal to the thres- 
hold value, the false null hypothesis was rejected and 
“counted."' The total count was then divided hy the number 
of times the ivsi wus cunducted te obtain the power of the 
test. | 

Flow charts of the main pregram, subroutine ANOV, and 


subroutine Wilson are shown in Appendix E, 


F, DETERMINATION OF TEST REPLICATIONS FOR DESIRED CONFIDENCE 


“Stein (1945) showed how a sequential sampling procedure 


SE RAMEIT AE cL setercee He Stren te Atel mae SION EIST Latent MERA 27’ ste RNS CINE SEN EONAR CLES —TG AORRY O: LAREN NO 
‘ aa . ‘é A ‘ 


could be used for establishing a confidence interval of fixed — 
length e for estimating the mean y having a confidence coef- 
ficient > 1-a. | | | 
This procedure was used to establish the number of test 
replications (N) necessary to obtain at least 95 per cont. | 
confidence of being within 20.02 (e = 0.04) in determining 
power of the test for the three F tests and three Chi-square | 


tests. 
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Stein's theorem and proof are contained in Reference 
14. Application of the theorem to obtain N is shown in Ap- 
: pendix F, The computer program utilized is attached follow- 
ing the appendices. The result was N = 3700. This was the 
maximum of the replications required for the six tests. 
The deciding test was the Wilson Chi-square test for the A 


(row) effect. 


_ eee ey 
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III. VALIDATION 


A. DETERMINATION OF THEORETICAL POWER, ANOV TESTS 
As stated above, a desired power of 0.67 for the ANOV 
interaction test was used in determining the simulation 


model parameters. Using the below equations from Reference 


1l, x P 
ne. a: 
on = SE Sat and (23) 
or 
E B2 
nr . 2 
o3 = tt > (24) 
orc 


and substituting applicable parameter values, theoretical 


powers for the testing the A and B main effects by ANOV F i 


tests were founce to be U.S: afd 9.96 respectively. 9 cal- 
culations and the applicable ANOV power curves are contained 


in Appendix G. 


B. ANOV POWER SIMULATION RESULTS, N(0,4) ERROR 
Three thousand-seven hundred data cases with N(0,4) 
error and the predetermined model parameters were tested 


with the ANOV subroutine. Power was cc puted and compared 


with theoretic] power. 


EFFECT OA B AxB 
Theoretical W/ 0.81 0.96 0.67 
N(0,4) Error 
Simulated W/ | 0.808 0.960 0.665 


~N(0,4) Error 


21 


bie A wi dnk a ile tam ee 


C. ACCURACY CHECK OF WILSON SUBROUTINE 

i 

[ One hundred data cases with N(0,4) error and predeter- 
mined model parameters were tested with the WILSON subrou- 


tine. Printout was made on median calculation, contingency 


table, and Chi-square values (total, A, B, AxB). Five of 


the cases were then selected at random and the same values 


ie lA PI ARNE ami! UI thie BA SCRE ci iE aK, Migs tS 8 8 i aah ccs MRA gn ice Saal TSA 


were calculated by hand. Results wers exactly the same. 


Se 


Dd. SIMULATION OF LEVELS OF SIGNIFICANCE 
Parameters of the simulation model, other than the over- | 


all mean, were set equal to zero. Then 3700 data sets were 


produced with N(0,4) error and tests conducted with the ANOV 
and WILSON subroutines. Based on this sample size the true 
level of significance was estimated within plus or minus 
0.02 with 95 per cent confidence. That is, by choosing F 
and Chi-square threshold values for rejection at the 5 ard 
10 per cent significance level, the chance percentage of 

| rejections should have been equivalent, within the criteria, 


to 0.05 and 0.10 for each of the ANOV F and Wilson Chi-square 


tests. Results were: 


Egat ESE pe mh ee, nie may 


§ PER CENT REJECTION THRESHOLD | | : 


EFFECTS oA B AXB 

ANOV F 0565 .0535 —«-. 0808 

WILSON x? .0586 «=©.0486 = —s«w 042 se af 
| 10 PER CENT REJECTION THRESHOLD 4 

EFFECTS A B AxB 

“ANOV F 1051 .2089Ss« 0985. 

WILSON x2 «0784. 0884 0954 
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iV. RESULTS 


Three thousand-seven hundred data cases for each of the 
four degrees of skewed, leptokurtic, and nlatykurtic error 
were tested with the ANOV and WILSON subroutines. Results 
are given below in three tables. Each table gives the 
power of the test for the three effects hypotheses with a 
particular type error. Pearson coefficients are shown for 
reference, as well as the actual means and variances of the 
error distributions generated. The last column of each | 
table gives the power of the test with N(0,4) error for 
convenience of comparison. Error degrees shown correspond 


to the curves of Appendix C. 
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ERROR DEGREES 


i. ¢ 2 3 4 a) 
ERROR Bo"9.03 ,=5.96 B.*4.57 B=3.47 B,=2.95 
DIST'N (EXPON) ___(NORMAL) 
COMPUTED DIST'N 
MEAN 0.00 0.00 0.00 0.00 0.00 

COMPUTED DIST'N . 

viReance 3.97 3.98 3.98 3.97 3.98 
Te eae ae a ee ee 

. 2 
Ha! z CH #0 | 
POWER ANOV 0.82 0.81 0.82 0.82 0.81 
POWER WILSON 0.74 0.64 0.58 0.51 0.47 

: eae 

. 2 
1 Rs 1 a eee a ee TE Se 
POWER ANOV 0.95 0.95 0.96 0.96 0.96 
POWER WILSON 0.97 0.91 0.85 0.78 0.74 


tt ee LO A RS PDS RPO AEST, RSET SE ENTE, 


Hy? BE (a8),, = 0 


fe) ‘ : 
Hy: LE (a8), # 0 | ‘ae 
POWER ANOV 0.68 0.67 0.66 0.66 0.66 — 
POWER WILSON -_ 0.36 0.30 0.27 0.25 . 0.25 


Sete LeN INE y SORENESS “ROAR a te AEE CARMONA OR re ER TREE > ee eR RPO PE betsy 2ye seize 


- eAacalles epote aay me 


ERROR DEGREES 


1 2 5 4 5 
LEPTOKURTIC B,=0.00 6,=0.00 B,=0.00 B,=0.00 B,=0.00 
ERROR B,=20.00 8477.95 B,=4.10 Bo=3.40 B.*2.95 
De EN ee ee eh ee Best 35 a (NORMAL) Us. 
COMFUTED DIST'N . 
MEAN | -0.10 -0.11 -0.12 -0.12 ).00 

COMPUTED DIST'N = 

VARTANCE 3.86 3.93 3.96 3.97 3.98 7 
To. 2. 
Ho: D> Or 0 

: 2 
Hy: 5 as # 0 = _ 
POWER ANOV 0.82 0.81 9,81 0.81 0.81 
POWER WILSON 0.56 __0.53 0.50 0.48 0.47 
eared ES aa PN Tat Wier geen ah ee ree Wace OR cet LANE NS UAT AIS TR NE EROS REIL 
Ho: Py B 0 


Hy: 2 B2 # 0 . 
A SECA TELA Reet cee ne ee Se LE ee ONS Ie VERE RAE, NOS oe eee eT 
POWER ANOV 0.96 0.95 0 96 0.96 0.96 


POWER WILSON 0.86 0.82 0.78 0.75 0.74 


Ha BE (a8),; = 0 
Wai TE (08),5 # 0 


POWER ANOV _ 0.68 0.67 0.66 0.66 0.66 
POWER WILSON -6.30 0.29 | 0.26 0.25 - 0.25 
25 


ERROR DEGREES 


1 2 3 4 5 

PLATYKURTIC B,70.00 8,=0.00 8,=0.00 8,=0.00 8,=0.00 
ERROR B "1.80 85*2.00 B,=2.25 8)=2.50 8,=2.95 
DIST'N (UNIFORM) _ (NORMAL) 
sae DIST'N 0.90 -0.03 -0.05 -0.06 0.00 
COMPUTED DIST'N 

eee: 3.97 3.97 3.97 3.97 3.98 

2 an 

Ho z a; = ¢ 

é 2 
Hy: + as # 0 
POWER ANOV 0.81 0.81 0.82 0.82 0.81 
POWER WILSON 0.37 0.40 0.42 0.44 0.47 
Hy: 5 Bf = 0 

. 2 
Hy: = 83 #07 
POWER ANOV 0.96 0.96 0.96 0.96 0.96 
POWER WILSON 0.55 0.60 0.65 0.68 0.74 


H,: rz (08) 55 = 0 
Ha: rt (OB); # 0 


POWER ANOV 0.65 0.65 0.65 0.55 0.66 
POWER WILSON 0.18 0.20 0.21 0.23 0.25 
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V. ANALYSIS AND INTERPRETATION OF RESULTS 


A. HYPOTHESIS TEST POWER CURVES 

Figures 1, 2, and 3, which cre attached following this 
section, compare the power of the ANOV F and Wilson Chi-square 
tests by the null hypothesis being tested. Each figure con- 
tains three sets of curves, one for each error distribution 
type. The horizontal argument of "error degree" corvesponds 
to the degree of non-normal error distribution type shown in 


the distribution curves of Appendix C. 


B. ANOV F TESTS 

Figures 1 through 3 show chat tor a 3x4 factorial with in- 
teraction and 6 observations per ceil the power of the F test for 
main effects and interaction effects hypotheses are unaffected 
by the thee types of error non-normality. The range of power 
change from the esvimated power with normal error is only 
- 0.01 to + 0.02 over all three hypotheses. It appears that 
there should be no concern relative to dupeadetion of the 
power of the F test beciuse of error non-normality, even in 


extreme cases, aS long as the data is unimodal. 


C. WILSON CHI-SQUARE TESTS 


The below curves indicate that the Wilson Chi-square 


BERT SNR RES me ee EME ATER Coen ype amit 


tests are not "distribution free" as claimed, The power of 


the tests for main effects and interaction effects are sen- 


em cwercmnee tae 


sitive to the shape of the error distribution. 


The normal crror power estimate for the Wilson A effect 


ema agetatent pe 


test was 0.47. Non-normal error ir that the test resulted 
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in a range of power change of -0.10 (violent platykurtic) to 
+0.27 (violent skewed) from the normal value. For the B ef- 
fect test the range of power change from the normal error 
estimate of 0.74 was -0.19 (violent platykurtic) to +0.23 
(violent skewed). For the AxB effect test the range of 
power change from a norm of 0.25 was -0.07 to +0.11, with 


the same error type extremes as before. 


In general as expected the power of the Wilson Chi-square 


tests were lower than the power of the comparable F tests. 
When the error distribution was N(0,4) for both, the power 
of the Chi-square test for A effect wns 58 per cent of the 
power of the comparable F test, the power of the Chi-square 
test for B effect was 77 per cent of the power of the F test, 
and the power of the Chi-square test ior Axb effect was 38 
per cent of that of the F test. However, there was one case 
where the power of the Chi-square test was higher than that 
of the F test (0.97 versus 0.95 in he wese for B effect un- 
der the condition of violent skewed error). 


The error effect trend for a particular error type was 


consistent over the three hypotheses tested. Note the simi- 


larity in shape, for example, of the skewed error Wilson 


power curve in Figures 1 through. 3. The rate of change in 
power with respect to error degree (as error varied from 
normality) was, however, significantly less for the Chi- 
sque.c test for inteyection-under the three error types than 


it-was in the Chi-square tests for both main effects under 


“equivalent error types. 
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VI. DISCUSSION 


6 UR lady deal ash 


A, WILSON TEST SHAPE SENSITIVITY 


{t is apparent from Figures 1 through 3 that the dis- 


A aaa Diabet dle 


tributions of the Wilson Chi-square test statistics under 


cai an he 


the alternate hypotheses are sensitive to the shape of the 


error distributicn. The question arises as to whether the 


distributions of these statistics under the null hypotheses 
are also sensitive to the shape of the error distribution; 

that is, whether the type I error (alpha) is changing with 

each degree of error distribution. 


To investivate this, level of significance estimates 


were made under the four degrees of skewed error distribu-- 


tion by determining power with the muodei cuefficients set 


} , equal to zero and Chi-square threshold values set for 
alpha = 0.05. Results were obtained and compared to level 


of significance results with N(0,4) error. 


A B AxB 
WILSON SKEW #1 .0624 .0524 =. 0489 
WILSON SKEW #2 .0659 0530 = .0432 
WILSON SKEW #3 0659 .0530 0432 
WILSON SKEW #4 = .0649 .0524 = 0454 


WILSON N(0,4) 0586 .0486 0411 


Although the true ievels of significance were slightly higher 
under the skewed error, they were higher by approximately the 


same amount in all cases. On this basis it is concluded that 
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the distributions of the Wil: n Chi-square statics are only 
sensitive to the error distribution under the alternate hy- 
potheses. 

Since the test as proposed by Wilson uses the Chi-square 
as the approximate distribution oi the test statistics it is 
evident that this approximation does hold under the null 
hy»othesis (at least in the right hand tail of the distribu- 
tion) but that the degree of approximation of the non-central 
Chi-square under the alternate hypothesis is sensitive to 
changes in the error distribution. Sawrey [15] has labeled 


such tests as “semi-nonparametric." 


B. USE OF THE WILSON TEST 

The above should not be interpreted to mean that the 
Wilson test is invalid and should be avoided. It is obvious- 
ly inferi r to the ANOV F test and should not be chosen when 
the data permits utilization of the parametric ANOV. How- 
ever, the Wilson test, to the author's knowledge, is the 
simplest of only two procedures applicable to testing for 
interaction when the data is qualitative and measurements have 
been obtained only on an ordinal scale (see Mood [16] for the 
other). It should be used in such cases with the realization 
that its accuracy depends upon the shape of the underlying 
error distribution. After histogramming the data, the re- 
sults of this paper may be used to provide general insight 
relative to the question of what power to expect. Pertinent 
power estimates for a particular design may be obtained by 


employing the computer simulation method presei.ted herein, 
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provided that the error distribution (unimodal) can be iden- 


tified from the data and the type interaction can be charac- 


Aba ae A hs Na pec 9 aOR SA EME 


terized. Twenty-eight and one half minutes on an IBM 360 
computer were required on the average in this study for ob- 


taining power estimates for the normal case and one non- 


ALEC eke reins alah 


normal error distribution. 


C. AREAS FOR FURTHER STUDY 
1. Power Efficiency 
A typical question facing a prospective user of the 
Wilson test is that of how many replications are necessary 
with a particular desi: 2 to attain a desired power, given 
a particular error distribution based on preliminary sampling. 
This is a problem in estimating power efficiency for the 
Wilson test and is a natural extension of this study. The 
same simulation model could be used. ; oH 
| 2. Different Type of interaction i j 
The present research could be extended to include 
other characterizations of interaction. In the course of 
this study the author at one stage characterized interaction 
by (a8); 5 = a; Cs, + bj. Incomplete results, based on only 
200 batches of data, indicated that the estimated power for 
the F tests was the same as that for the characterization 
stated herein (i, 6. (a8) 55 = a; ¢3), but that the cstimated 
power for tne Wilson Chi-square tests was considerably dif- 
ferent from that under the present characterization. 


3. Effect of Homoscadacity on Power 


It is belicved that a computer simulation method, | 


gf 4 
ai 


similar te the one used here relative to non-normality, could: 
34 
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be used to study the effect on power of the parametric ANOV 


requirement of homoscadacity. Error distributions could be 


Sta a a ea re 


generated with different degrees of error variance, data 


tested by the ANOV and WILSON subroutines, and the results 


analyzed. 


4. Semi-Nonparametric Tests 


Another research area recomuended is that of "'semi- 


Sh cea a OL ale ce ea 
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nonparametric" tests. An appropriate thesis might consist 
of extracting all such tests from the literature and then 


showing why they are semi-nonparametric with resultiny im- 


RE Bro Set EE IN eR 


plications. Reference 15 would be a good starting point 


for such a thesis. 


5. Effect of Sample Size on Non-normality Effect 
A final extension of this study might be to analyze 


the effect of sample size (replications per cell) on the power 


LEAR Oe SS il drag e go et ee add ney sods 


o the ANOV F and Wilson Chi-square tests under the same con- | 
ditions of error non-normality. Would a doubling or tripling 
of sample size tend to flatten out the Wilson power curves 


shown in Figures 1, 2, and 3 of Section V or would tney just il 


be shifted upwards? Would the ANOV F tests still be unaf- 
fected by crror non-normality if the sample size were only 


two replications per cell? 
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VII. SUMMARY 


A complete simulation method has been presented for 
estimating power for both parametric ANOV tests and Wilson 
non-parametric ANOV tests. A 3x4 ANOV simulation model 
was used with six replications per cell. Interaction was 
characterized by (8) 45 = 04 Cy with c,'s constant, in- 
creasing, evenly spaced, and £ c; = Q. Error non-normality 
was characterized in for degrees of three unimodal error 
distribution types: skewed, leptokurtic, and platykurtic. 
It was shown that when the error was normal the Wilson Chi- 
square test for A effect was 58 per cent of that for the 
comparable F test, the Chi-square test for B effect was 77 
per cent of that for the F test, und the Chi-square Lesi 
for AxB effect was 38 per cent of that for the comparable 
F test. It was further shown that the Wilson Chi-square 
tests were not distribution free as claimed by Wiison but 
were sensitive to the parent drseqibdtion Shape. Lepto- 
kurtic and skewed error Jistributions increased the power 
of the Chi-square tests above that estimated with normal 
error. Platykurtic error distributions decreased the power 
of the Chi-square tests from that estimated with normal 


error. The power of the ANOV F tests were unaffected by 


even the extreme cases of error non-normaiity. 
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APPENDIX A: DESCRIPTION OF THE WILSON TEST 


1. The median value, My for the entire set of n ob- 
servations is determined. The number of observations less 
than M,, represented by n,» is then calculated. 
| 2. <A 2xrxe contingency table is constructed where r 


if 


and c represent the number of rows and columns of the de- 
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sign and the "third dimension 2" corresponds to the divi- 
sion of scores by Mg: The frequency entries for the 


contingency table are represented by fi? the number of 


merge ea RE Le tT EN 


observations less than My for the cell in row i and column 
j of the table. It follows that 


n= yo rae 


rn +3 ° { 
“ 2 . v 13 | 


ij 
The below contingency table example is given for the purpose 


of clarifying notation: 


cote ye near pen aire cE ne orn mente ee 


Ma = 100.54851 
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FACTOR A 
1 2 3 4 bij. cg 
5 6 2 2 15 
FACTOR 
B 2 3 2 4 2 11 
3 4 4 1 1 10 
pf.j 22 12 7 5 36 
7 7 all 
n., = 6 
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r= 3 
c= 4 
b*ij = # observations < Ma 
ny = 36 
n= 72. 


3. Since the number of observations for cach cell in 
the ANOY design of interest are all equal and M, can be cal- 
culated such that ag n/2, the total Chi-square value can 


be computed as follows: 


AO R)EE bts rel oa 
where n/2rc represents the expected frequency under the null 
hypothesis that the main effects and interaction effects 
produce no change in the distribution of scores. XT has 
(rc-1) degrees of freedom. 

4. The Chi-square values of the row effects and column 


effects are computed using the marginal totals of the 2xrxc 


contingency table. 


XR * | x [of : | (1-3) 
and 
a(S] E hes - 28} aw 


; : 
where pfy. i pt 


,2 & : 
ij and vf .j F pfij: As before, the expec 


ted frequencies for the main effects, n/2r and n/2c, are ob- 


tained for the null hypotheses that the distributions of 


scores are identical for all levels of the row or column 
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effects. XR and Xe have (r-1) and (c-1) degrees of freedom 
respectively. 
5. The Chi-square value for the interaction effect is 


computed by subtraction. 


XE = Xp ° XR 7 XE (1-5) 


or has (r-1) (c-1) degrees of freedom. 

6. The tests for the main effects and interaction are 
made by comparing the obtained values of Xpe Xoo and Xi with 
values from the cumulative Chi-square distribution for the 
appropriate degrees of freedom and desired significance 
level. 

7. Wilson [6], citing Rao (1952) and Cochran (1954) 
concluded that there was ne problem concerning small ex- 
pected frequencies as long as the contingency table has 30 
or more degrees of freedom. He further concluded that 
ordinary Chi-square tables were applicable as long as the 
30 degrees of freedom criteria held. 

8. Wilson (6) included formulation for Chi-square 
values when n> the number of observations greater than or 
equal to My is not equal Ny and when the Nij are not ali 
equal. In addition he extended application of his test to 


experimental designs with other than two factors. 
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APPENDIX B: DEVELOPMENT OF 12 CELL MODELS 


1. The Mathematical Model 


Yigk "Bt Oy * By * COB) Gs * Zi5K 


i=1,..,3; j#1,..,43 k#1,..,6 


is expressed in matrix notation as 


x Bo. > Z 


~< 
It 


too} | bw 
See es a, 
1001 a | 
i100 | | Ai 
ie 4 , B2 
1100 | Bs 
ae aS 
ene ‘ ’ 2 
100 | oi, 

ee a. ts oat | 
1100 i r 261s : 
sare aga | 
1010 | | aby 
ee oe 
lole | i 
pore. | 
ioié | : 
boo! | : 
1010 | | [ 
ae 
1001 | i 7 
poet | Wy 
loot | i i 
A ea i. 
1ooi | i i 
1001 | | i. 
bot | 

(72x1) (72x20) (20x1) = (72x11) 
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2. Model conditions are: 


= 0 


wD 
Pa 
+ 
wD 
nN 
+ 
WD 
w 
H 
oO 


CoB)y 4 + (aB)12 + (08), 
(a8) 31 * (GB) 22% (a8) 95 
(OB) 5, + (aB),5 + (08) 2, 
CaP) ig: AOR g a ABs 
(OB) + (a8)5, + (a8) 5, 
(a8) ,, + (0B) >, + (0B) 55 


(aB)iq + (0B) 24 + (a8) 5, 


a = “Oy be O49 
By = “By = Bo - Bs 


(OB) 44 = ~CaB)a4 - (08) 


ae Mala i a 
(OB) 4 . ~ (a8) 44 - (a8) 44 


(0B) 9 = (0B), 7 (0B) 55 


(a8) 55 = -(08))5 = (af) 55 


(a8) 5, = ~(08),, = (a8) 5, 
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+ 


+ 


oa 
jo4 
RD 
— 
N 
& 
fi 


cp 


> (06)) 5 


(08) 55 


(2-3) 
(2-4) 
(2-5) 
(2-6) 
(2-7) 
(2-8) 
(2-9) 


(2-16) 


3. Rewriting the aoove and eliminating duplication: 


(2-11) 
(2-12) 
(2-13) 
(2-14) 
(2-15) 
(2-16) 


(2-17) 


(2-18) 


| 
i 
| 
F 


Thus the matrix equation can be reduced to an equation in 


only 12 parameters (vice 29). 
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4, The new new matrix equation is expressed on page 


The following cell models are obtained by expanding the 


new matrix equation. 


Model for 


Yuk 7 Ut oy 


Model for 


ek 
Model for 


13k 7 HF %y 
Model for 


< 
ti 
zc 


a ar + a, 
are 4 


BED ae 
Model for 
Your 7 Ut Oy 
Model for 


York = Ht % 


Model for 


Your 7 Bt G2 


‘Model for 
Moa i ag 
= COB) 98 

| Model for 


Vark "HF 


- *3ik 


cell one: 

+ By + (OB) 44 + ~41k madre yO 
cell two: 

+ Bo + (aB) 15 + 210% k=1,¢°,6. 
cell three: 

+B, + (a8) 13 + Zip k=1,°°,6. 
cell four: 


ba B, = B. * B, - (oB),, 


a (a8) 13 + 214k ; k=1,¢+,6, 

cell five: . 

= By + (a8) 94 * 201k k=1,°°,6. 

cell six: 

+ B, + (a8) 59 + 202k k=1,°*,6. . 
cell seven: 


+ By + (o8)>3 * 223k: k=1,++,6. 

cell eight: 3 | 

“ By - 8, - 8. (08) 2) | 

~ (08) 25 * Zeax a aa 

cell nine: 

= a, + By 7. (0B) yy (aB) >) . 
ae k*1,°°,6. 
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(2-19) 


(2-20) 


(2-21) 


(2-22) 
(2-23) 
(7-24) 


(2-25). 


(2-26) 


(2-27) 


(72x1) 


(12x1) 


(72x12) 
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ao ae a 

* 232k 
Model for 


Yaa, 7 Ho7 ay 


TS 5ak 
Model for 


Yoqn 7 U7 Oy 
* (a8), 


* (08) 


celi ten: 


cell eleven: 


cell twelve: 


" Go > By 


7) + Bo * (08) 55 rs (a8) 55 


k=1,°°,6. (2-28) 


2 + 8. = (ab). . (08) >. 


* k=1,°°,6. (2-29) 


- Bo - Bz + (oB)ay 


PCBOV ge CERI ae © 20h G3 


+ 


*34k 
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k=1,°*,6. (2-30) 
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APPENDIX D: DEVELOPMENT OF MODEL PARAMETERS 


1. The power curve for determining theoretical power 
for the test of interaction, 3x4 ANOV model, is attached 


‘o this appendix. Numerator degrees of freedom are 6, 


(r-1)(c-1),. and denominator degrees of freedom are 60, 
rc(n-1); where r is the number of rows (3), c is the nun- 
ber of columns (4), and n is the number of replications 
per cell (6). For a desired power of 0.67 at a level of 
Significance of 0.05, ¢ = 1.3. 


2. From Reference 11 


Cc 


r 
CAxB © z ooo % y. | = (1.3)? 


zrl jr71. 4 


1.69. (4-1) 


Since a variance of 3.98 was actually obtained in simulation 
of the N(0,4) error, this value was substituted for o?. 
Values for r, c, and n stated above were also substituted. 


Then, rearranging (4-1) 


3 4 
y y. (a8)? = 1E3.98)(1-69) 2 7, 9387, (4-2) 


-3. Any combination of (a8) 55's such that (4-2) holds 


will give the desired power. But the interaction was char- 


Sel ea aemeceeb teachin tac 


acterized by (a8) 55." aic; with c;'s evenly spaced, constant, 
increasing with By and & cos 0, Initixsi values for C were 


also arbitrarily chosen as c, * -3, Co = l, Cc," 1, and cy * 
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cit actcan iio heen 


3. Initial values for a, were also arbitrarily chosen, 


subject to the condition y Os = 0, to be a, = 71.03, 


a. = 0.28, a, = 0.75. 
4. Substituting these values for Ot and c, 
YY 845° EY coe; = ¢¢-2-03)¢-3)) 
; oe : | 
+ee4 ((0.75)(3) = 34.0360. 


Thus each value of the initial selection for Cc; was too 
large by the multiple (34.0360/7.8387)* = 2.0837. Initial 


c5's were then divided by 2.0837 to obtain: 


= -1.4 971 = -1.44 


ie Vigne oe £2 
c, = -@.47990 - -9.48 


2 
cz = 0.47990 = 0.48 
Cy = 1.43971 = 1.44, 


Ser ARC REennng. era nsgemptengninateT ER HUTS OE RT Wty EFFORT, 1A PUR EET LINER ONT tare re ENE TIME a a eae eis a aaalatias 
: ° (SREB ORES EL A OE TE TT RL OTe EN ea ee ee Selita Bi 


5. Using the original a,'s and the adjusted c,'8, 


values of (08) 55 = (a5¢;) were calculated to be 


(0B) 44 = 0)C) 1.4832 


e 


(a8), * a,c. ¥ 0.4944 | | q 
(a8))5 ° acs * 0.4944 
(aB) 4" a,c, = -1,4832 
(08), = a,c, * -0.4032 
(08). 7 apc, * 70.1344 
(a8) 93 * ancy = 0.1544 
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(a8), = 4c, = 0.4032 


(a8) 5, = OnCy -1,.0800 


(08) 34 = a,C, 1,0800. 


These values of (a8) 55 met the model conditions stated in 
Appendix B (i.e., = (a8)., = 0, i=1,++,3, and = (aR).. = 0, 

j 1) i 1) 
j=l,e*,4). 

6. Values of B, were then chosen arbitrarily subject 
to the condition j B = 0. Values were selected in the 
neighborhood of the a; values so that no single main ef- 
fact would ba dominant. Vatucs chosen were 0, = -1.9%, 

4 
B. = =1.03;; ea 1.03, 8, = 1,03. 
7. The overall mean, represented by yw in equations 


3) through (14), was chosen arbitrarily to be 100.00. 
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APPENDIX E: COMPUTER FLOW DIAGRAMS 


1. Main Program 


INITIALIZE 
PARAMETERS 


* MAKE 72 
ERROR VARIATES 


[ 

| MAKE 6 DIFFERENT OBS. 
| FOR EACH CELL USING 
: 


RAS ak RNR AN a a Me Thay acta hg He ten onkiet ce ctson wt ue 
~ ee 
a tielitada ois. ‘nia 


ae 


EQNS (3) - (14) 


CALL SUBROUTINE ANOV ~~ — 


aS es RETURN 
/ ARE FA, ‘ 
fis, FAXG | ‘Se 


DON'T 
COUNT 


>THRESHOLD VALUES 


| cour 


POWER FA = TOTAL COUNT FA/N 
POWER FB = TOTAL COUNT FB/N 
POWER FAxB = TOTAL COUNT FAxB/N 


he Fak tt Ee SIE RNR AO Nat Sint hc Se sce he Oe, ABS Rie: ehh fb Ried et 
es . EE ie . : 7 
iii te ainsi sania: 


H 

: 4 
i 

: a4 
| if 
f ~ —— ee ~ RETURN 3 q 


ARE y2A, 
x?B, x?AxB 
2 THRESHOLD VALUES 


wie eA aden 


DON'T 
COUNT 


$2 


Ln SO ARR RTT 


reer 


a TO 


fe NE ERR ERP RTE IE ET ETS AERP ETD 


CPST ENE ES OO ANGI POLIS ROUEN cae no MEAL RR 


cence ementee rs ae st ette 


et Peers png er som 


POWER x2A = TOTAL COUNT x2A/N 
POWER x2B = TOTAL COUNT x?B/N 
POWER x2AxB = TOTAL COUNT y?AxB/N 


N_TIMES 


NOTES: 


*This package was changed for each level of each type 
error. 


Selected computer programs used are attached foilow- 
ing appendices. 


2. Subroutine ANOV 


LOAD 72 GBS IN 18x4 
MATRIX. MATRIX ROWS 

1-6 EQUIV TO DESIGN 
ROW 1, ETC. 


CALCULATE SUM OF EACH 
OBSERVATION SQUARED 


ave 


CALCULATE SUM OF EACH 

OF 4 DESIGN COLUMNS 
AND ITS SQUARE. 

TDIN(J), TNIDSQ(J) 


CALCULATE SUM OF: 
EACH OF 12 DESIGN-CELLS © 
AND ITS SQUARE 


$3 


| - YSQ. (I,-!) ; | ge * eee 
: fe eC TIJD(J), TIIDSQ(J = 4 
| ) | : | SY 
Ce pe ee 


CALCULATE SUM OF EACH 
OF 3 DESIGN ROWS 
AND ITS SQUARE 


. TIDD(1), TIDDSQ(I) 


COMPUTE OVERALL SUM 
OF OBS. SQUARE IT. DIVIDE 
BY 72 TO GET CORRECTION 
FACTOR 
COR. = TDDDSQ/72 


“aren Seiwa resachers tabetha seen 46e5 Stan Atelad sctabtbimehaans AAS E INS athe. wibchcanetnin teh sbeestaine Sic atay: eens sithid cee 80 Gf 
= i SY fe Te rae siti eget xs 


COMPUTE TOTAL SUM OF SQUARES 
TSS = £ 2 YSQ(I,J) - COR 
ij 


COMPUTE B SUM OF SQUARES 
BSS = i TDIDSQ(j)/18 - COR | 
J =S : é 


COMPUTE A SUM OF SQUARES 
ASS = = TIDD(I)/24 - COR 


i=] 


COMFUTE AxB SUM OF SQUARES. 
axBss = }* TrypsQ(M)/6 - ASS - BSS - COR 
“m= ) ; Boars 


COMPUTE ERROR SUM OF SQUARES |! 
ESS = TSS - (ASS + BSS + AxBSS) | 


Cee tee i eS ee oer ere Ceo RRR LTR Gr em EE: AE EO EE II ST 


wee eda set 


Lo pA SRN fe ITY PEE ETT PLETE NT RR ET SET EFT ETN ES TS - 
- hs ts Pik US Se Ns Oe 3 Dak ex inthis Stes MBs ae By ARNS getadlbgstrnn nis Dae aw lant 


COMPUTE A MEAN SQUARE 
AMS = ASS/2 


COMPUTE B MEAN SQUARE 
BMS = BSS/3 


COMPUTE AxB MEAN SQUARE 
AxBMS = AxBSS/6 


COMPUTE ERROR MEAN SQUARE 
oMS = ESS/60 


COMPUTE F RATIOS 
AMS/EMS 
BMS/EMS 
AxBMS/EMS 


RETURN 


o6 Subroutine Wilson 


aga Capi aan | 
ORDER .72 OBS IN 
COLU:IN VECTOR 


{FIND MEDIAN OF 72 088 | 
XMED = (YOBS(36) + YOBS(37))/2 


IN EACH CELE OF 2x3x4 
CONTINGENCY TABLE 
1) .LT.XNED)NBSUM # NBSUM+1 


$$ 


FIND FREQUENCY OF OBS < MEDIAN | 


Pes eng ee a : i : Soe ae es te ae ce ettihet ao mannan dobonee abana talent tee oe re eats 
PITT NST, gta TRIED Se OO OS ORT NE NRCS TUN OMNI Fe NS TD Dercieer erator Taeiraeer cr OrTeoe eS Ne 


ee 


FIND CONTINGENCY TABLE ROW SUMS | 


| NBFID = a NBF(1,J) I=1,..3 


FIND CONTINGENCY TABLE COLUMN SUMS 
NBFID = # NBRF(I,J) J=1,..4 
I=1 


FIND TOTA!, CHI-SQUARE VALUE 


eT = G) (3) (4) rE (NBF(I,J) - OLOIGe 


| FIND CHI-SQUARE A VALUE 


Y 
j FIND CHI-SQUARE B VALUE | 


a 


FIND CHI-SQUARE AxB VALUE 
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APPENDIX F: DETERMINING TEST REPLICATIONS 


1. Ten ANOV and ten WILSON subroutines were executed 
on data generated with the simulation model, N(0,4) error. 
Power of cach of the F and Chi-square statistical tests 
for main effects and interaction was conputed by the method 
shown in Appendir E. The subroutines were then executed 
ten more times and power computed in a similar manner. This 
was repeated until 200 ANOV's and WILSON subroutines had 
been executed consecutively giving a sample size of 20 power 
values per test, each power value based on the number cf 
false null hypotheses rejected out of ten tricls. 

2. Sample variance was then computed for each test. 


6G. =X)? , (6-1) 


26 
i 


ee 
eT aST 


Q 


i=] 
where X; is the ith nower value and X is sample mean power 
value based on the 20 Xi 's. n. = 20 | 

3. Using the student's t distribution table, the 


value for k was determined so that the following equation 


held. - 


Prob | = < - i [e deo (6-2) 


2vk 7 nyo} 2vk 

where ec = .04, not = 19, and l-a = .95. 
4. For each of the six statistical tests involved Ny 
the pumber of replications required for desired confidence 


on the value of X, was determined by: 
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et aca SRE ee ce tn neem came, 


Ny = Max (ei + 1, n) (6-3) | 


where [S?/k] represents the greatest integer < the ratio. 
5. The value obtained for Ny was then multiplied by 
ten to obtain N, the number of WILSON and ANOV subroutines 


to be executed for desired confidence, The multiplication 


was necessary since ten replications were required for each 
value of X,. 


6. The maximum of the N values for the six tests in- 


volved was then determined and rounded up to the nearest 


100. Results are summarized below: 


N,ANOV = Max (1560, 520, 1670) = 1670 (6-4) 


N,WILSON ~ Max (3010, 2$39, 1966) - 3616 (6-5) 
N= 3700. (6-6) 


7. Determining test was the Chi-square test for A ef- 
fect. The computer program utilized is attached following 


the appendices. 


APPENDIX G: THEORETICAL POWER, ANOV MAIN EFFECTS TEST 


A cere te eee Oe A ET TRENT staat 


1. Reference 11 gives 
r 
nc DS as 
ae oe (7-1) 
o*r 
Here n = number of replications, 6; c = number of design 


columns, 4; r = numbe 


variance, 4; and — ag 


2 


r of design rows, 3; o° = desired 


sum of the squared a, parameters 


i 
from Appendix D. Substituting 

62 = {sO 7018 er 
and 

6, = 1.5. 


The applicable nower 
numerator degrees of 
degrees of freedom = 
read from the curve, 
Significance. 


P. 


= 
= 


Here £ b 


sum of the squared By*s from Appendix PD 


curve, attriched to this apne dix, has 


2, and denominator 


freedom = r-l = ? 


> 


rc(n-1) = 60. Power of the test, as 


is 0.81 at the S per cent level of 


) 
Substituting 
oz = (6) (3) (12456) 2g 2g 
: (4) (4) 
and 
op e 2.18. 
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The applicalle power curve, with numerator degrees of free- 


fini siete oiled 


dem = 3 and deonominator degrees of freedom = 60, gives 


power of this test = 0.96. The power curve is attached. 
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